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To investigate the co.ncurrent validity of the 
National Teacher Examinations, test scores of over 31,000 candidates 
were correlated with self-reported Grade Point Averages (GPA"s). The 
overall correlation between the Weighted Common Examination Total 
(kcET) and GPA was .37. Validity indices for the Area Examinations 
ranged from .08 to .50 with a median of .33. Using 18 selected 
institutions, the correlation of their mean WCET scores within five 
GPA levels and GPA level was .70. The latter result suggests overall 
correlations behave more like ]ower bound estimates. The WCET and 
most Area Examinations were concluded to have at least moderate 
concurrent validity. (Author) 
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One of the major problems in an timpiricai study of test validity 
is that of obtaining accurate <.ritorion data. in m*iny instances one must 
rely on slow and costly questionnaires arid be eon ten ted with whatever r<.'spou.ses 
received. The National Teacher lilxami nat i on (NTH) program at Hducatiorial 
Testing Service (ETS) has attempted to deal effectively with this problem 
in efforts to conduct empirical studies relating to the quality of its tests. 
Starting with the November, 1973 administration of the OTE, research information 
nas been collected directly from candidates by means of a series of questions 
printed on the registration form. The questions are both biographical and 
ed'icational and candidates are assured chat their responses will not affect 
te.->t so.res^ Subsequently, the responses can be matched with candidate scores 
(for research purposes only) and anaLy;^ev! so as to study certain teclmical 
aspects of the tests o\ program. 

The present study developed out of an initial investigation of the "s 
correlations between the candidate background data and performance on the 
MTH, The results revealed a positive correlation between candidate self- 
reported grade no in t average and NTL performance. This paper represents a 
detailed description of that finding. Although this study yields substantial 
information pertaining to the concurrent validity of the NTE, it was not 
initi.iily de.si^a'i*d as a definitive study of the NTE's concurrent validity* 

>H:RPOSn : 

purpose of the National Teacher Examinations is to objectively 
i.srjes-. the iL.idemlc prtnaration of college seniors. Tin (-ommon Exami n<i t ions 
provide an appraisal of i prospective teacher's basic professional preparation 
and \>[ representative as[)<cts of general educational studies. The Professional 
Educati.Mi Test (110 itet^sj ;^ieasures achievement In three dimensions of pro- 
fessional -liucaLion: p-;vclio logical foundations; societal foundations; and 
leiihin'' ['rinrlplfs and p net ices. The three General Education Tests are: 
',%ri:Len 'n:-:li-.li f^vpr«.ssioii (-o items); Social Studies, Literature, and the 
M'i< Arts (f^>'- items Sc[e:ue an^ Mathematics (50 items). The Weighted Common 
: Xcin Inns lotil (^vrFi) is a sum of scores on the four tests weighted, 
respiT : i , , I, i . .r". 1 .'\3. Tne tv;en t v-e i gh t Awa llxaminations aid in 
fV.i J 'i u i . : '.^ . .i 1 1 ^^reparat i<u) ro tcMc^i or practice in their chosen 
f i< ids. 
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The content validity of lUv tlTE is discussed in detail in National 
Teacher Examinations Technical Kandbook (KTS, 1973) which contains references 
to related empirical studies. The purpose of this study, however, is to estimate 
the concurrent validity of the NTE using self-reported cumulative undergraduate 
grade point average (CPA) levels (of^ ranges) as the criterion measures and 
Pcarson-produv. t^moment correlation coefficients between the WCET and CPA level, 
and btHween an Area Examination score and CPA level as indices of concurrent 
validity. The study was conducted under the assumption that grade point 
averages refb^^cc academic success, both in the professional and general com- 
ponents of teacher education curricula and in the various areas of Leaching 
spec i 'il ii^a 1 1 ons , 

A number of siibscores are normally computed for each candidate 
taking the NTL; however, this study focused on just those scores (WCET and 
Area Examination score) upon which designated receivers of the scores typically 
make decisions and interpretations, (The NTE subscores are substantially 
correlated with the WCET; see the NTE Technical Manual). 

SAMPE E: 

T\h' s.<:nf)lf for t hi prrs^nl slndy cons i ,s t oi .ill caiuiidaUs taking 
the Nli: in November 107 J .md ianuary lO/A (over H),0()0 candidates), '\hr 
number of stihjects ranging from 25 to 10,036 varied from one result to 
another depending on several factors, such as the particular Area Examination 
taken and the availability of self-reported candidate information, 

ll should be noted that the test-taking population of the NTE is 
not ^eograpl'.ical.lv rep. e^>enLa ti ve of all persons entering the teaching profession. 
ApnrnxiTPa telv 75^ c: trv. N'TE candidates are from the South Central, South 
and Middle AllaiUi^, and '.ri-. New England Staces. There ^^/as , however, no 
^t^i.so^ lo --a.-p-. a pj'_fc\!j tliar there would be systematic differences in the 
relationship i^f Mi* s^v^res to CPA leVt ? eiween beginning teachers from the 
eastern pan of Lhe United States and o trier beginning teachers. No sucii 
i:) /t' s t i gat i'>:i Wcis sw[)sequi ntlv maae nor deemed necessary, 

ftir^IaUons v.u oStamed bt tween :,e 1 f- reported CPA and both tiie 
W(!:r i:ia Hit Are.; hxan i na L i (^ns . 'I'ne correlations were separately compured 
bv are.i in: ^"or aJ ! ^ ruii'iates. In idditicMi, for each area, the co7*relation 
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betw-een the WCET and Area KxamLnatlon sron^'wos ohLained using all candidates 
with both scores available. 

Ordinarily, one would expect that a grade point average is at least 
in part a function of the institution attended* Thus, as the basis for another 
correlation in the study, scores from a sample of educational institutions 
were drawn. It v^as judged that the institutions should be reasonably large 
in order to promote stability of the data and integrated to reduce the chance 
of racially biased data (approximately 20% of the NTE candidates are Black). 
Hence, the candidate score files lor the pooled administrations were searched 
in such a manner so as to select all scores from any institution for which 
there were at least 20 Black and 20 White candidates who designated the 
instituti)n as their undergraduate school. Eighteen such institutions were 
detected. The WClT scores for aJl students who were members of the eighteen 
selected schools were processed so as to produce the mean WCET for each 
level of r^A within each of the selected institutions. These derived means 
were then used as data to compute a correlation with CPA. The effect of 
ttiis procedurtj was to observe the ,rel<itionship between the self-reported 
GPA level and the V/CET — allowing oniy institutional variation to operate 
wit h 1 n -CiP A level. 



RESULTS : 

lahio i reports means ^ standard deviations, and concurrent validity 
indices for of the 28 Ar^^a Examinations administered nationally. Data 
•nr the l<'sts in German, introduction to the Teaching of Reading, Texas 
Govt niTii^n t , and Audiolo^v aru not reported since there- were fewer than 
iweatv ( and i dates availahlo for those areas. Tiie validity indices ranged 
iron .OH to .'SO with a median indcLX of about .33 or .34. It is not clear 
why some lests have a relatively lew index or, conversely, a relatively 
hijzh inJt*>, . Areas such as Guidance Counselor, Educational Administration, 
and Re.sdin.'^ ^.>ei ialist could be expected to have lower indices because they 
arc rcflevticn^ of graduate programs in which, typically, candidates have ^ 
lii,:h u.uier.u'a J li.i te (PA*s wiiicii J.- not vary nearly as much as those of candidates 
wno Jvi .:ot enter <:radu.Ue prourarrs: hence, the validity indices tend to be 
]ov;er. riiis explanation d ie.-, r.ot* however, account for low indices for the 
te^t in Mt> ^ ' ^ Plivsica 1 Kdiic ii ion. Chemistry, Phys ics, and General Science, or 
Luu^ari"*:) [-^ i:i Urban .atting. It is similarly pu^:?ling why some tests produce 
a h i ^;'h i n ie>: . 
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Insert Table I about here 

Correlations between CPA level and WCET by area are about the same 
magnitude as the validity indices for the Area Examinations. These are shovn 
separately in Table I and have a range of .16 to .47 with a median correlation 
of about .36. The correlation between GPA level and WCET using all students 
regardless of their teaching area was computed to be .367 and is consistent 
with the median of the correlations reported separately by area. This relation-^ 
ship is displayed in Figure 1 which shov/s selected percentiles of the distribution 
of WCET by GPA level. 

It is Important to keep in mind that the relationship depicted in 
Figure I is a result which ignores the variation in GPA from one institution 
to another; for example, a 3.0 GPA in one school does not necessarily indicate 
the same level of ability or achievement as a 3.0 in another school. Despite 
this possible confounding, there are differences ranging from 23 to 47 scale 
sTt-'-o points between the means at successive self-reported GPA levels. 



Insert Figure 1 about here 



Correlations between the WCET and the Area Examinations are reported 
in Table II. These correlations are fairly substantial, ranging from .40 to 
.90 with a median coefficient of about .84, These results suggest a considerable 
amount of overlap between the two scores. It is reasonable, however, to 
expect that students who do well in their specialties will do well generally 
and converse Lv. Nevertheless, an inspection of Table II shows that the inter- 
correlation^ are consistently lower than the Area Examination reliabilities. 
This result ir.plies that some specific variance remains in one or both scores 
that may offer viddilionnl information unrelated to grades. 



Fnsert Table II about here 
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TABLE It 

RELATIONSHIPS BETWEEN THE WEIGHTED COMMON EXAMINATION TOTAL AND 

ARE/\ EXAMINATIONS 



Teaching or Professional Area 


Correlation Between 
WCET and Area 
Examination Score 


Reliability * 
of Area Examination 


Art Education 


.78 


( 667)** 


.92 


Biologj' and General Science 


.85 


( 633) 


.95 


Business Education 


.86 


( 981) 


.91 


Chemistry, Physics and General Science 


.79 


( 179) 


.94 


Early Childhood Education 


.89 


(3,670) 


.90 


Education in the Elementary Schooj 


.88 


(8,812) 


.92 


Education of the Mentally Retarded 


.85 


(1,266) 


.89 


Education in en Urban Setting 


.85 


( 25) 


.92 


Eaucational Admin. & Supervision 


.79 


( 147) 


.89 


English Language and Literature 


.90 


(1,876) 


.94 


French 


.65 


( 174) 


.94 


Guidance Counselor 


.81 


( 77) 


.92 


Home Economics Education 


.86 


( 774) 


.90 


Industrial Arts Education 


.87 


( 260) 


.92 


Mathematics 


.71 


KLyVZ.y ) 


.93 


Media Specialist — Library & A/V 


.84 


( 171) 


.94 


Men's Physical Education 


.83 


(1,130) 


.87 


Music Education 


.83 


( 778) 


.91 


Reading Specialist — Elem. School 


.84 


( 69) 


.90 


Social Studies 


.89 


(2,286) 


.95 


Spanish 


.40 


( 233) 


.94 


Speech--Conir.unication & Theatre 


.79 


( 205) 


.82 


Speech Pathology 


.72 


( 240) 


.94 


wO!ne:i's Physical Educai.ion | 


.88 


( 885) 


.91 


, . 1 







Froni formal test analyses published at ETS. 
Niiinbor of candidates in parentheses. 
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Results of the separate correlation using eighteen selected schools 
are shown in Figure 2, The points within each CPA le^'ol represent the 
mean WCHT at those schools for all candidates whose self-reported CPA's were 
at that level. The mean of those points is reported above each level designation 
along with the total number of candidates at the CPA level in all eighteen 
schools. A difference can bii observed between the means at adjacent levels; 
morl^over, the differences are quite marked for the high CPA levels. Using all 
seventy mean WCET points in the figure and corresponding CPA level as paired 
data, the correlation between them was found to be .70. This coefficient is 
much higher than that reported in Figure 1 because the variation among students 
is removed. 

The appreciable variation among the institutional V/CET means at 
fixed r:pA levels suggests that if sufficient data were available within 
individual institutions, the concurrent validities for the WCET would be 
higher than thor.e shown in Table I. This paper takes the position that the 
concurrent validities reported in Table 1 behave more like lower bound estimates 
of concurrent validity. In other words, typical estimates of cor urrent 
validity would more than likely be higher than those reported in Table I 
if the data were gathered fromv/ithin an individual school. 



Insert Figure 2 about here 



SUMMuNRY AND COXCLUSIQNS; 

Using more t'nan 30,000 candidates the WCET of the National Teacher 
Examination Program had an overall correlation of .37 with self-reported CPA 
leveU Validity indices for the Area Examinations ranged from .08 to .50 
vitn a r.edian index ot .33 using all candidates within each area during the 
November 1^73 and January 1974 administrations* 

In a separate correlation using institutional WCET means and CPA 
Levels, tiie index of relationship was found to be .70. 

An interpretation was made that the overall indices confounding 
institutional differences were more like estimates of tiie lower hound of 
concurrent validity. 
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The results discussed lierp give a rcnsonable indication that the 
WCET score and most Area Kxaminat ions scores !],ive moderate concurrent validity 
using a sell-reported GPA level as thu criterion measure, 

iVlthough the GPA does not actually reflect the substance of one's 
curricular exposure, it is a readily available index among the credentials 
of a prospective teacher that is usually evaluated by a potential employer, 
i'he variation of grading standards among institutions as depicted in Figure 2, 
however, implies the need for a standard objective instrument from which one 
can infer and/or corroborate both the content and standing of a candidate's 
academic preparation. 
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1-2 



GPA LEVELS 
3 ' 4 



N 490 
MEAN 551 



1,625 
556 



1,826 
593 



5 
665 
637 



700 - 



650 - 



600 - 



550 



500 - 



450 - 

1 



••• 



KEY 




r=.70 


1-2 


= f.5-2.49 


3 


= 2.5-2.99 


4 


= 3.0-3.49 


5 


= 3.5-4.0 



400 



FIGURE Z- WEIGHTED COMMON EXAMINATION TOTAL UNWEIGHTED MEAN 
SCORES OF EIGHTEEN SELECTED SCHOOLS BY 
SELF -REPORTED GPA LEVELS* 
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SCORES WERE POOLED WITH ADJOINING GPA LEVEL IN THE LESS EXTREME DIRECTION 
IF LESS THAN TWO CANDIDATES WERE IN A PARTICULAR GPA LEVEL 
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